Linguistic features weighting for a text-to-speech system without prosody model

نویسندگان

Vincent Colotte

Richard Beaufort

چکیده

This paper presents a Non-Uniform Units selection-based TextTo-Speech synthesizer. Nowadays, systems use prosodic models that do not allow the prosody to vary as far as we should hope, involving a listening comfort degradation. Our system has the advantage to avoid the using of prosodic model. Speech units selection builds its features set exclusively from the linguistic information generated by the natural language analysis. We also present an original method to automatically weight these features. Therefore, selected units are not restricted by a predetermined prosody. With only using linguistic features, we obtain a various prosody and the units concatenation is performed without resort to heavy signal processing.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Linguistic features weighting for a without prosody

متن کامل

An efficient text analyzer with prosody generator-driven approach for Mandarin text-to-speech

A new approach for an efficient text analyser is proposed. The prosody generator-driven method is employed to design an efficient text analyser for Mandarin text-to-speech. More simple structure of text analysis, more suitable classification of linguistic features and more efficient contribution of linguistic features to the prosody generator can be achieved. Three heuristic and theoretical met...

متن کامل

Unsupervised prosody labeling for constructing Mandarin TTS

This paper introduces an unsupervised prosody labeling method for preparing a large speech corpus used in developing a Mandarin Text-to-Speech system. Adopting a four-layer prosody hierarchy, the proposed method performs an unsupervised segmental clustering that iteratively segments spoken utterances into strings of prosodic constituents and models the patterns of the segmented prosodic constit...

متن کامل

New rule-based and data-driven strategy to incorporate Fujisaki's F 0 model to a text-to-speech system in Castillian Spanish

We will present the analysis of a Spanish prosody database by estimating the parameters of Fujisaki's model for FO contours. These parameters are classified attending to linguistic features and they form the analysis database. When synthesizing FO contours we extract the linguistic features from the text and perform a k-Nearest Neighbour search. Linguistic feature comparison distance is trained...

متن کامل

MeLos: Analysis and Modelling of Speech Prosody and Speaking Style

This thesis addresses the issue of modelling speech prosody for speech synthesis, and presents MeLos: a complete system for the analysis and modelling of speech prosody “the music of speech”. Research into the analysis and modelling of speech prosody has increased dramatically in recent decades, and speech prosody has emerged as a crucial concern for speech synthesis. The issue of speech prosod...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Linguistic features weighting for a text-to-speech system without prosody model

نویسندگان

چکیده

منابع مشابه

Linguistic features weighting for a without prosody

An efficient text analyzer with prosody generator-driven approach for Mandarin text-to-speech

Unsupervised prosody labeling for constructing Mandarin TTS

New rule-based and data-driven strategy to incorporate Fujisaki's F 0 model to a text-to-speech system in Castillian Spanish

MeLos: Analysis and Modelling of Speech Prosody and Speaking Style

عنوان ژورنال:

اشتراک گذاری